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(57) Abstract 

A method and apparatus are provided for compressing data 
received from a digitizer tablet based on the characteristics of each 
individual stroke contour (150, 160). The digitizer tablet samples 
the position of the writing pen, continuously transmitting data to a 
computer in the form of x and y coordinates plus an indicator of 
whether or not the pen is touching the surface of the tablet (100, 
1 10). After preprocessing, a determination is made of which points 
along the stroke contour are essential to the reconstruction of a 
quality facsimile of the original writing from the sampled points 
(140). This determination is based upon local curvature, local ex- 
treme, and the endpoints of the stroke. In an optional second stage 
of compression, a standard mathematical compression technique 
is applied (160). 
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BACKGROUND OF THE INVENTION 
The present invention relates generally to data 

capture applications, and more specif icallv to -hh^ ^ 

, yc'-j.i-Av-axj.y, to the compression 

of data generated on a digitizer tablet. 

10 When something is written on a digitizer tablet the 

output is a parametric representation of the writing 
trajectory; that is, the writing is represented as a series of 
X, y coordinate values as a function of time, a typical 
digitizer tablet samples the position of the writing pen 100 
...es per second, at fixed intervals of o.Ol seconds. The 
tablet continuously transmits the resulting data to the 
receiving computer. Thus every o.Ol seconds, the tablet 
samples the pen position and transmits an x-coordinate value a 

Whether the pen .s up or down. The data generated in this 
manner is commonly referred to as electronic ink. 

There are applications where it is of interest to 

^-^-^le to save an electronically generated signature for 
= .ater signature recognition. Another application is the 
permanent storage of data which has been generated on a 
digitizer tablet. 

stora„ """"" """"" ink is that the 

str. .e,u.re.ents can ,uite la.,e the original sa.plea 

rea.c..=n .echn.,ue for electronic ink that can achieve high 

aa.a compression and yet be efficient ro , . 

J ciiicieni. to compute. The 

^.ay used on computers with relatively r,odest CPU power 
' a °' '^"'^ ^--^^ -^^^ 3i,nificantly'sio:ed i. 

s-^;;! ,rr"'""'°" '"""'""^ ^° ^^^^^^ » each pen 
wh'Ich p : ' " '"^ continuous segment 

w Oh begins when the pen touches the digitizer tablet ■ = 
s-.ace ana ends when the pen is lifted fro. the surface , 
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There are a number of methods for data compression. 
The earliest methods were based on the idea of minimum 
redundancy coding. The basic idea was very simple; symbols 
which occur frequently are coded with fewer bits than those 
which occur more rarely. The earliest known method to apply 
this technique was the Shannon-Fano coding. A subsequent 
development was Huffman coding, which is similar to Shannon- 
Fano coding in many ways. Huffman's contribution was in 
devising a very clever and elegant way to construct the coding 
and decoding trees. In general, simple Huffman coding does not 
achieve the same degree of compression as more modern 
techniques, but processing and memory requirements are 
relatively modest. 

A further enhancement of Huffman coding is called 
adaptive Huffman coding. A basic problem with standard Huffman 
coding is that the decoding program must have access to the 
Huffman coding statistics. Hence, trying to increase 
compression with more comprehensive statistics is eventually 
self-defeating, since more and more coding statistics must also 
be saved, which works against the compression. Adaptive 
Huffman coding gets around this problem by adjusting the 
Huffman decoding tree on the fly, effectively gaining the 
advantages of higher order modeling with -o additional storage 
for statistics. 

Huffman coding requires that an integral number of 
bits be used to code symbols. Arithmetic coding improves 
compression by using a fractional number of bits per code. 
Although arithmetic coding will always provide as good or 
better compression than Huffman coding, its disadvantage is a 
significantly larger computational burden. 

Dictionary coding is another class of data 
co-pression. in general, these techniques create a dictionary 
of symbol sequences or combinations which are then accessed by 
indexing into the dictionary. These techniques have become the 
stand -d for compression on small computers because they 
ccmbi good compression with modest memory requirements. 
Examc.es of codes using these techniques which are applicable 
to MS-DOS include PK2IP, ARC, ARJ , and Lharc. 
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There are also compression techniques that apply 
primarily to specialized domains. An example is compression of 
two dimensional images using the discrete cosine transform and 
its many derivatives. 

A method and apparatus for compressing electronic ink 
which achieves high compression while requiring minimal memory 
and CPU power is needed. 

SUMMARY OF THE INVENTION 
According to the invention, for a computer with a 
digitizing tablet, a method and apparatus are provided for 
compressing data based on the characteristics of each 
individual stroke contour. 

The digitizer tablet samples the position of the 
writing pen a pre-deternined number of times per second and at 
fixed intervals. This data is continuously transmitted to a 
computer in the form of x and y coordinates plus an indicator ' 
of whether or not the pen is touching the surface of the 
tablet. 

In the preferred embodiment the incoming data from 
the digitizer tablet is preprocessed. This preprocessing, 
Which is application and/or system specific, is not part of the 
compression. Examples of preprocessing include smoothing for 
noise reduction, endpoint filtering, discarding duplicate data 
points, median filtering. Manning filtering, spatial sampling, 
and size normalization. 

In the first stage of compression a determination is 
made of which points along the stroke contour are essential to 
the reconstruction of a quality facsimile of the original 
writing from the sampled points. The method proceeds by 
calculating descriptors and finding essential features of the 
s-rcKe contour. in the preferred embodiment, the descriptors 
are local curvature, local extrema, and the endpoints of the 
stroke. Although the actual algorithm is somewhat detailed in 
general the idea is that the extrema are saved, the points ' 
aoout Which the local curvature is above a threshold are saved, 
and the endpoinrs of the stroke are saved. 
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In the second stage of compression any number of 
standard compression techniques can be applied including 
Huffman coding, adaptive Huffman coding, arithmetic coding, and 
a variety of dictionary compression techniques, m the 
preferred embodiment differences of the form Ax = x- - x- and 

= yi - Vi-i are calculated and the stage 2 comprLsiorL 
applied to the differences. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a flow chart of the general compression 
method according to the invention. 

Fig. 2 is a flow chart of the extrema determination 

step. 

Fig. 3 is a flow chart of the stroke endpoint 
15 determination step. 

Fig. 4 illustrates the local curvature calculations. 
Fig. 5 is a flow chart of the duplicate data points 
preprocessing step. 

Fig. 6 illustrates the endpoint filtering 
20 preprocessing step. 

Fig. 7 is a flow chart of the endpoint filtering 
preprocessing step. 

Fig. 3 is a flow chart of the median filtering 
preprocessing step. 

Fig. 9 is a flow chart of the Banning filtering 
preprocessing step. 

Figs. lOA-C illustrate the reconstruction of an 
electronically written signature without applying this 
invention's compression techniques. 

Figs.. iiA-c illustrate the reconstruction of an 
electronically written signature after applying this 
invention's compression techniques. 

Figs. 12A-C illustrate the reconstruction of a 
sequence of lower case letters after applying this invention's 
compression technique. 

Figs. 13A-C illustrate the reconstruction of a simple 
.me after applying this i.nvention's compression technique. 
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Figs. 14A-E illustrate the trade off between 
compression ratio and reconstruction fidelity. 

Fig. 15 illustrate the apparatus to accomplish the 
disclosed method. 

DESCRIPTION OF THE SPECIFIC EMBODIMENT ( S ) 
The flow chart of Fig. l illustrates the compression 
method described by this invention. Data is input by writing 
on a digitizer tablet (step 100). The tablet represents the 
writing as a series of x and y coordinate values as a function 
of time (step iio) . This stream of x,y position coordinates 
along with a pen up/pen down indicator for each coordinate pair 
is continuously sent to the computer (step 120) . 

In the preferred embodiment, prior to data 
compression, the data is preprocsssed (step 130). The exact 
form of preprocessing will vary according to the application 
and system. Preprocessing is generally used to remove noise. 
The details are unimportant as long as a good representation of 
the written input data is maintained. Another optional step 
which is dependent upon the application and system is the 
removal of duplicate or close points (step 140) . This step is 
generally necessary when the digitizer tablet's user writes 
slowly. Slow writing causes oversampling resulting in duplicate 
or close points which are redundant and do not carry 
significant information. 

The first stage of compression is based on the 
characteristics of each individual stroke contour (step 150). 
In the second stage of compression any of the standard 
mathematical compression techniques can be applied (step 160) . 
Although this stage of compression is not required by the 
invention, it is employed in the preferred embodiment as a 
porential means of further compressing the data. 

After the completion of the second stage, a 
comparison is made between the first stage and the second stage 
compression (step 170). If the compression was not improved 
after applying stage two, then the stage one compression data 
IS saved (step 180). if the data was further compressed by 
stage two, then this compression data is saved (step 190) . 
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'The computer program modules used to compress the 
electronxc data are given in the appendix. Significant program 
modules and their specific functions are as follows: 

Stage one compression: 

=alc_sav.c Function to detennine which points in the 

signature need to be saved, based on various 
criteria. 

curvature. c Function to calculate the local curvature at a 
point along the contour. 

Stage two compression: 

cmpsig2.c Function to compress an ar-ra,, «^ *. 

v-wii.ijj.e5s an array of stage 1 data 

with characteristics of the signature 

verification template data. 

pack_half bytes, c 

Function to pack the value to be compressed into- 
half bytes. 

set_4_bits Function to pack values into the upper or 

lower 4 bits of a byte. 
get_4_bits Function to unpack values from the upper or 

lower 4 bits of a byte 
uncmpsig2.c Function to uncompress "an array of data 

compressed by the function cmpsig2 c 
unpack_hal f bytes, c 

Function to unpack and reconstruct the value 
from half bytes. 

Figs. 2-4 are detailed illustrations of the staae one 
compression methods. Fig 2 show«; « fi . 

fig. 2 shows a flow chart of the first 

The d " ^^^^^"^ determined and saved 

ue "riV!" - ^ - coordinate 

alues Xi and y • , as a function of time (step 210). The 

n L::: rb" '^'^^^ ^ ^^-^^ — 

ini.xalized by settx.ng i equal to 1 (steo 220). 

Minima and maxima are determined by comparing, the 
va ue Of , ,,,,, ^^^^^^ P . 

a^rectly adjacent to it (step 230). To be a maxima, the value 
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'*of either the x coordinate or the y coordinate must be larger 
then the values of the corresponding coordinate of the data 
points on both sides of the data point under review (step 230) . 
To be a minima, the value of either the x coordinate or the y 
coordinate must be smaller then the values of the corresponding 
coordinate of the data points on both sides of the data point 
under review (step 230) . 

If any of the four relationships defined by step 230 
are met (step 240), then that data point will be saved (step 
2 50), otherwise the data point is not saved. After determining 
whether or not to save a data point, the value of i is 
increased by one (step 2 60) and a determination is made as to 
whether or not all data points have been reviewed (step 270) . 
If all data points have been reviewed then this subroutine ends 
(srep 280), otherwise the subroutine loops back to the 
comparison step (step 230), continuing until all data points 
have been reviewed. 

Fig. 3 shows a flow chart of the second step of 
compression in which stroke endpoints are determined and saved. 
The digitizer data is in the form of a sequence of coordinate 
values, and y^, as a function of time (step 310). The 
subscript i varies from i=o to i=nsamps-l where nsamps is the 
total number of data points in a stroke. The subroutine is 
initialized by setting i equal to 0 (step 315). 

This subroutine first determines whether or not the x 
and y coordinates of the data point indicate that it is the 
first data point of the stroke (step 320). If it is the first 
data point, then it is saved (step 330). Next the subroutine 
deterirdnes whether or not the x and y coordinates of the data 
poinr indicate zhat it is the last data point of the stroke 
(srep 340). If it is the last data point of the stroke, then 
it is saved (step 350) . After determining whether or not to 
save the data point, a comparison is made to determine if all 
data points have been tested (step 360). If they have been 
then this subroutine ends (step 370), otherwise the value of i 
is increased by one (step 380) and endpoint determination 
continues . 
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^^9. 4 illustrates the calculations used to determine 
the curvature at each data point between the stroke endpoints. 
The stroke contour is represented by line 410 while the 
individual digitizer data points are represented by points 420. 
Point 430 is data point i, the data point of interest in this 
Illustration. The coordinates of 430 are and y^. 

In Fig. 4 line 440 is defined as the line segment 
connecting point 430 with point 425. The coordinates of point 
425 are x^.^ and y^_^, where d is equivalent to 3. Line 450 is 
defined as the line segment connecting point 430 to pbint 435 
Where point 435 has coordinates Xi^^ and y^^^. curvature angle 
460 is the angle defined by lines 440 and 450. 

During stage one compression, the curvature angle for 
each data point is calculated, if the calculated curvature 
angle is within a certain range of angles, the data point is 
saved. If the angle does not fall within the pre-selected 
range of angles, the data point is discarded. The choice of 
the variable "d" as well as the range of threshold angles is 
determined by the application's requirements. In the preferred 
embodiment, the value of d is 2 ; data points which have a 
curvature angle between 5 degrees and 25 degrees are retained 
for applications requiring high fidelity, low compression 
(e.g., signature verification); and data points which have a 
curvature angle between 25 and 50 degrees are retained for 
applications requiring low fidelity, high compression (e.g., 
simple data storage) . 

Figs. 5-8 are detailed illustrations of optional data 
preprocessing methods. Fig. 5 is a flow chart of the method by 
Which duplicate data points are discarded. The digitizer data 
IS m the form of a sequence of coordinate values, x. and y- 
as a function of time (step 510). The subscript i varies from 
1-0 to i=nsamps-l where nsamps is the total number of data 
pomrs in a stroke. The subroutine is initialized by setting ^ 
equal to 1, x^^^ equal to Xq and y^^^ equal to y^ (step 520). 

The current data point, represented by x. and y. is 
compared to the previous data point, represented by x 'and 
Vsav (Step 530). If the values for the x and y coordinates are 
Identical, then the current point is discarded (step 540) and 
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"the next point is compared to X and v 
continues until either all of ^^^^"^ 

reviewed rsten 5.0/ '''^^ ^^^^ 

lewed (Step 560) or until x^ and y. do not equal x and 

ysav respectively, when the two ri^fl ■ 

5 equivalent, x is s.t . "° ^°"9er 

^sav set equal to x. and y is =01- ^ 
(Step 545). The value of i is th.^ • ^^"^ ^° 
«nri increased by one fsten c;Rn^ 

10 (step 570). subroutine ends 

writer's unsureness rasultL "^"^ t° the 

13 contact, or due to pen s" !^"' " °' "'^ 

preprocessing .e^JVTnZZ. /nt ' "^"""'^^ 
remove these artifacts. '^"er.ng which can be used to 

points ..o""c™r "^'^ °^ ' ''"^ 

point .35, and cli:, „ T '"^ 
point s«, are use! o L: tit" T 
either discarded or saved L the f 
Physical radius o. this circ ITslTlTTs 'T'^'"'' 
^.onher of data points contained with n , = 
oependent upon the resolution of the di.it Ler taMet" 
resolution of a typical tablet is 10 Ji.l 
Therefore the threshold t 

hy .uitipi.in, the des L ; ;:::ard::::" " ^'"-^^ 

the tablet, resolution Uno^ts^":"!:::;: 

Preproces^, Jetho;. i^r rtaT""' 

sequence of coordinate values 

(=tep 710,. The subscript . ;arL. ^ ' °' 

Where nsanps is the totaf " 7 ^-""■"Ps-l 
-e subrouLe ^1:^:^^ T"" ' 

'15. and setting , e,ual to i (step""' '"^^ " ° 
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is less than t, 
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"indicating that this point is Within Circle 630 th. 
corresponding to x • v = ^ • =ircie 63 0, the point 

f J-ny to Xj , y. IS discarded rsten 

3 is then increased by one fsten ,Tn^ ! " "^'"^ °' 

(Step 730, i. repeatel Jil celculaticn for d 

^ less then the ..Ls" , ,1 T^Lrt""""" ^ 

is saved as „e second det" poin^ s.eT^"" IT " 
=™p:etes the subroutine .or endpoint fut n '.or 
beginning of a stroke. ^ 

The endpoint filtering method used for th. , . 
" Of a stroke is identical to that used for tie first 
e«ept that to initialise the subroutine i is LT 
nsa„ps-l (steo 715) and i is „^ , *° 

Median filterin, \s '° """'^'^ • 

preferred e^oir/n^L redu'" r"?' """" 
15 glitches that result <n suddl^ " °' 

aata strea.. ri,. 3 l^T " «-»tinuities in the 

-he digitizer data s iVth L:^"' illustrating this method, 
values, and y. as a fu . ° °' coordinate 

subscript^ varu^ rom i-"\ °' ""^ '"'^ ' 
" total number of data p^i^s 

initialised by settL:"";:: l:TZ.. .^V"^-""- ^ 

cocrdinate"i;«ie™rsr oTT' T ^""^ 
successive data points ^^'"^ °^ ">ree 

" tnen increased by one ' e^s ^ ™^ ^^^^ " ^ 

until the value of i is . subroutine continues 

-nus two <ste : 0 Zll llTlT^' " 

(step 870) . '""^ '''^ subroutine is complete 

A flow chart of the i 
- the preferred embodiment is UlustrTter""? " 

::i:::"r::: y" vv- ^ ~^o1-c:;.::l 

subscript^ vaJi; p 1":":™ " ""^ '"^ ' 
total number cf data points in a s"tTr"' 
^= Initialised by setting i ™" subroutine is 

by setting 1 equal to 1 (step 920). 

In this method, a linear r,— . 
noise and smooth the stroke r T " '° ""^"o- 

y coordinate ircalcuiL . . " ^^'"^ "^'^ 

calculated based on the initial values of 
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"three successive data points (step 930) Th. v 1 " ' 

then increased by one t<.t.r. « ^^"^ °^ ^ is 

until the value Of ii.' ^ subroutine continues 

.inus two (Step sso) at 2Tl T """^^^ °^ 

^ (Step 960) . 

« time the subroutine is complete 

reconstruction fidelity, i. "'"^ 

Signature based upon the clrll.l """"ructed 

required only 219 bytes „rst ^"'^ 

original. The staoe o " '° """""^ °^ the 

^0 ss, bytes, .ich"::: :::th~:: iti:t:t ^^^^ - 

"age two compression technique. '^"'^'"^ = 

letters. 1^ in"V^';3 Tif " 
1-0 received .ro„ the li,ZLll\lT. 
5 determined to be critical t^ .K 

- is the reconstrut::; ^^j:^'::::^^ ^'''- 

original required 2072 bytes th. ^ ' 

^-ge one compression on^quted""^^ 

301 ^ytes as percent? ""^"-^0" «s applied required 

aotomaticany- aZts't" t."" T 

little structure. ,3B sh " t r,:?" r""^"^ 

received from the dir,i^,- • actual points 1310 

to be critical to t^e r o^nl""" ^"^^^--^ 
reconstruction 1 .!!!°""^""^°" "^""y. 



?ig. 

The 



^=~tion based upon circled Z^l^'J^^lX 
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" stage one compression has been applied. The reconstruction 
required only 6 bytes and represented a compression to 2 
percent of the original. In this case stage two compression 
was not applied because it increased the number of bytes to 14. 

Figs. 14A-E illustrates the trade between compression 
ratio and the fidelity of the reconstruction. Fig. 14A is the 
original of a printed address which required a total of 1808 
bytes. Fig. 14B shows the first level of compression which 
employed a compression ratio similar to that used in Figs nc 
and 12C. After both stage one and stage two compression, this 
reconstruction is 13 percent of the original. Figs. 14C-E show 
gradually increasing levels of compression. The reconstruction 
Shown in Fig. 14C, requiring only 9 percent of the original 
appears quite similar to the original. Figs. 14D-E, although 
quite readable, show definite differences from the original 
The reconstructions in Figs. 14D and 14E require 8 percent and 
7 percent of the original storage space, respectively 
Although some applications require high fidelity, such as 
signature verification, many applications could tolerate the 
lower fidelity accompanying the high compression ratios shown 
in Figs. 14D-E. 

Fig. 15 shows an embodiment of the apparatus to 
acccmpiish the method described in this invention, m the 
prererred embodiment, the computer 1550 is an IBM/PC/AT 
co.,patible with an IBM compatible monitor 1540 and keyboard 
1510. computer 1550 employs the methods disclosed by this 
invention to compress the electronic ink. Handwritten 
information is input either directly via the hardwired 
digitizer tablet 1520 or via a portable digitizer tablet 1530 
Portable digitizer tablets are in common use in many 
moustries, including the package delivery industry where an 
electrcnic signature is obtained upon delivery of a package. 
Depending upon the computing power of tablet 1530, the 
signature can either be immediately compressed using this 
invention's compression methods or temporarily stored for 
sunsequent downloading to computer 1550. Once the handwritten 
aata has been compressed, it can either be stored in comouter 
1^=0 or in separate memory 1560. Monitor 1540 allows the 
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'* original handwritten data to be .compared with the compressed 
data. 

The invention has now been explained with reference to 
specific embodiments. Other embodiments will be apparent to 
those of ordinary skill in this art in light of this 
disclosure, it is therefore not intended that this invention 
be limited except as indicated by the appended claims. 
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APPENDIX 

The computer program modules used to compress the 
electronic data are given in this appendix. Significant 
program modules and their specific functions are as follows: 

Stage one compression: 

calc^sav.c Function to determine which points in the 

signature need to be saved, based on various 
criteria. 

curvature. c Function to calculate the local curvature at a 
point along the contour. 

Stage rwo compression: 

cmpsig2.c Function to compress an array of stage 1 data 

with characteristics of the signature 
verification template data. 

pack_half bytes, c 

Function to pack the value to be compressed into 
half bytes. 

set_4_bits Function to pack values into the upper or 

lower 4 bits of a byte. 
get_4_bits Function to unpack values from rhe upper or 

lower 4 bits of a byte. 
uncmpsig2.c Function to uncompress an array of data 

compressed by the function cmpsig2.c. 
unpack_hal f by tes . c 

Function to unpack and reconstruct the value 
from half bytes. 



/* CALC_SAV.C 
★ - . 



* Function to deterniine which points in the signature need to 
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The results are 



* be saved, based on various criteria. 

* returned in array save_ptsc. 

* For example, if the ith point of x[],y[] should be saved 

* then save_ptsC[i] = i, else save_pts[i] = O; 

* cur_coinp returns: i M.e yes ny tottttx 

li.e. iti, or. TRUE) for successful 

scaling 

* 0 (i.e. NO or FALSE) if there is an 

error 

* 

* input arguments: curvature_threshold, nsamps, numsegs, 

* segpnt, x, y 

* 
*. 



output arguments: save_pts 



*/ 

sinclude <stdio.h> 
=include <cic.h> 

BOOL calc_sav ( curvature_thr eshold , 
y, save_ptsc) 
COUNT curvature_threshcld; 



nsamps, numsegs, segpnt, x, 
/* threshold for curvature*/ 



COUNT nsamps; 



COUNT numsegs; 

COUNT segpnt [ ] ; 

COUNT x [ ] ; 

COUNT y [ ] ; 

TINY save_ptsC[]; 



COUNT i; 
COUNT k; 



/* compression 
/* number of samples 
/* (length) in input 
/* arrays x and y 
/* number of individual 
/* segments (strokes) 
/* pointers to beg of 
/* individual strokes 
/* input array of x 
/* values 

/* input array of y 
/* values 

/* contains information 
/* about what points to 
/* keep (returned to 
/* calling program) 
/* loop counter 
/* loop counter 



*/ 

V 

*/ 

*/ 

*/ 

*/ 

*/ 

*/ 

*/ 

*/ 

*/ 

*/ 

*/ 

*/ 

*/ 

*/ 

*/ 

*/ 
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10 



20 



30 



— -^wv* ±. 




/* 


temporary variable 


*/ 


COUNT 


value2 ; 


/ 


temporary vanaole 


*/ 


COUNT 




/* 


temporary variable 


*/ 


COUNT 


lower limit; 


/* 
/X 


defines starting pt 


*/ 






/* 


for curvat compress 


*/ 


COUNT 




/* 


defines ending pt for 


*/ 






/* 


curvat compress 


*/ 


BOOL 


1 a +• 11 / \ • 


/* 


function to calculate 


V 






/* 


local curvature 


*/ 


COUNT 


^ ^ C Clil^XCf 


/* 


calculated curvature 


*/ 






/* 


at a point 


*/ 


BOOL 


C \ OC)CLJ 1 CO R • 

^ ^ w w Jv W X O C O f 


/* 


YES If clockwise, NO 


V 






/* 


otherwise 


V 


BOOL 


calc_dir ; 


/* 


if YES calculate 


*/ 






/* 


dir_angleP 


*/ 


COUNT 


dir_angle; 


/* 


direction of curvature 


*/ 






/* 


bisector 


*/ 



Initialize the save^ptsC array, 



for (i = 0; i < nsamps; *+i) 



{ 



save_ptsC[i] ~ l; 



Perform curvature based compression (determines which points 
to keep of the stroke and saves the information in array 
save_ptsC to return to the calling program). 



:or (i = 0; 



i < numsegs ; ++i) 



lower_limit = segpnt[i]; 
upper_limit = segpnt[i + 1] ; 
while (lower_limit < upper_limit) 
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checks = NO; 

for (k = lower_limit +2; k < upper_liinit ; k += 2) 

( 

curvature (X, (k + lower^limit) / 2, 
nsainps, (k - lower^limit) / 2, 
fitsize^angle, &clockwiseB, &calc_dir , 
&dir_angle) ; 
if (size_angle > curvature_threshold) 
{ 

checks = YES; 
break; 

) 

) 

if (checkB) 

( 

value2 = (k + lower^limit) / 2; 
valuel = value2; 

} 

else 

{ 

valuel = segpnt[i +1] - l; 

value2 = (valuel lower_liinit) / 2; 

) 

for (k = iower^limit ^ l; k < valuel; --rk) 

if (k != value2) 

save_ptsC[k] = O; 

j 

if (CheckB) 

lower_liinit = value2; 

else 

lower_limit = upper^limit ; 

} 



•k - 



Return ro calling program with successful completion flag. 
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*/ 

return (YES); 

/* 

* 

*/ 

^ /* end of function */ 

/* CALC_SAV */ 

/* CURVATURE. C 

* Calculates the curvature, direction of curvature (angle) 

* bisector, and whether the change in angle is clockwise or 

* counter_clockwise at a point defined by function argument 

* "index" into the arrays of coordinates poinrs y[], 

* representing the stroke contour. The quantities returned 

* are in terms of the angle range 0-3 60 degrees. 



* 



* If ERROR is detected th£i BOOL function value is YES (=i) , 

* while for a normal return the BOOL function value is NO 

* (=0). 



★ 

* CAUTION 



when the curvature is 0 (i.e., calculated 
size^angleP = o or close to 0), the calculation of the 

* direction of the curvature bisector is, of course, 

* inherently ambiguous and unreliable. 
* — - . ^ 

* / 

#inciude <stdio.h> 
^include <cic.h> 

3C0L curvature(x, y, index, maxpts, Ighseg, size.angleP, 
clockviseBP, calc_dirB, dir_angleP) 
COUNT x[j; ^ coordinate values of */ 

/* character " ★/ 
COUNT y[j; y coordinate values of */ 

/* character */ 
COUNT index; p^^t in x, y to calc. */ 

/* curvature */ 
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/* 

* - 



COUNT maxpts; 



COUNT Ighseg; 



COUNT *si2e_angleP; 



BOOL *clockwiseBP; 



BOOL calc_dirB; 



COUNT *dir_angleP; 



COUNT plus^point; 
COUNT minus^point ; 
COUNT plus__angle; 

COUNT minus^angle; 

COUNT dif^angle; 

COUNT cic_atan2 0 ; 
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*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 



/* maximuin length of 
/* arrays x,y 
/* length of segment for 
/* curvatur calc 
/* calculated curvature 
/* at point index 
/* YES if clockwise, NO 
/* otherwise 
/* if YES calculate 
/* dir_angleP, otherwise 
/* otherwise return 
/* dir^angle = o 
/* direction of curvature */ 
/* bisector 

/* stores "index -r- Ighseg"*/ 
/* stores "index - Ighseg"*/ 
/* stores calc direction */ 
/* of upper seg, 
/* stores calc direction 
/* of lower seg. 
/* stores angle angle 
/* difference 
/* function to calculate 
/* angle in the 
/* range -1800 to 1800 
/* (degrees x 10) 



*/ 
*/ 
*/ 
*/ 



*/ 
*/ 
*/ 
*/ 



* Check for errors in specifying the range for curvature 
computation. 



J3 



V 



rainus_point = index - Ighseg; 
plus_point = index ^ Ighseg; 
if (ininus_point < o'|i pius_point >= maxpts) 



( 



*si2e_angleP = O; 
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*clockwiseBP = YES; 
*dir_angleP = 0; 
return (YES) ; 



) 



/* ERROR - out of bounds, */ 
/* cannot calc. curvature */ 



/* 



* calculation of the local curvature (i.e., angle change). 

* Note: the function cic_atan2 returns angle values in the 

* range -1800 to 1800, so it is necessary to divide by lo to 

* get the desired curvatures. 



plus_angle = cic_atan2 (x [plus_point 1 - >c[indexl, 

y[plus_point J - y[index]); 
minus_angle = cic_atan2 (x [ index] - x[ininus_point] , 

y[ index] - y [minus_point ] ) 
dif_angle = plus_angle - minus_angle; 
if (dif_angie > 1800) 

dif_angle -= 3 600; 
else if (dif_angle < -1800) 

dif_angle += 3 600; 
if (dif_angle < 0) 

( 

*si2e_angleP = (-dif_angle ^ 5) / lO; 
*clockwiseBP = YES; /* by convention 

/* clockwise 
/* angle changes are 
/* negative 



) 



else 



/* 



( 

*si2e_angleP = (dif_angle + 5) / lO; 
*clockwiseBP = NO; /* by convention 

/* counter-clockwise 
' /* angle changes are 

/* positive 



*/ 
*/ 
*/ 
*/ 



*/ 
*/ 
*/ 
*/ 
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* Calculate the direction of the bisector of the local 

* curvature (i.-e., angle change) computed above in the range 

* 0-360 degrees. Note: the cic_atan2 returns angle values in 

* the range -1800 to 1800, so an adjustment is made in the 

* code below so that the resulting calculated bisector value 

* is in the range 0-360. 
» 

*/ 

if (calc_dirB) 

{ 

if ( *clockwiseBP == YES) 

*dir_angleP = plus_angle - (900 + (dif_angle - i) 
/ 2) ; 

else 

*dir_angleP = plus_angle + (900 - (dif_angle + i) 
/ 2) ; 

if (*dir_angleP < o) 

*dir_angleP += 3 600; 
*dir_angleP = (*dir_anglep +5) / lO; 
if (*dir_angleP >= 360) 

*dir_angleP -= 3 60; 

) 

return(NO); /* 

^ ' /* normal return - no */ 

/* errors detected */ 



) 

/* 

3 0 CMPSIG2.C 



/* end of function */ 
/* CURVATURE. C */ 



* Function to compress an array of data with charcterist . 

* the signature verification template data. The companion 

* program to uncompress the data is UNCMPSIG2.C. NOTE 



ICS of 



35 * compression/uncompression is exact (not lossy) and 
* restores the original data exactly. 
* 



* 



cmpsig2 returns: i (i.e. YES or TRUE) for successful 
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* compression 

* 0 (i.e. NO or FALSE) if there is an 

error 

* 

* input arguments: 

* numin - number of elements (length) of data array to 

be compressed 

array - data array to be compressed 

maxout - maximum allowed size of the compressed array 
comp_arrayUG[ ] (to prevent overwriting 
memory) 



* 
* 

10 * 



* ourput arguments: 

* num_comp_bytesP - ptr to # elements (each a byte) of 

* the compressed data stored in 

comp_arrayUC[ ] . 

comp_arrayUC - unsigned byte array containing the 
compressed representation of the input 

* data in array [ ] ; 

20 * 

^include <stdio.h> 
^include <cic.h> 



★ 
* 
★ 



25 300L cmpsig2 (numin, array, maxout, num_comp_bytesP, 

comp_arrayUC) 
COUNT numin; 



30 



35 



COUNT array [ ] ; 
COUNT maxout; 



/* number of points in input */ 
/* array 

/* array containing input */ 

/* data to be compressed */ 

/* max number of pts for */ 

/* comp_arrayUC */ 

COUNT *num_comp_bytesP; /* actual #bytes used in V 

/* corap_arrayUC */ 

/* output array of compressed */ 

/* data */ 



TBITS conp_arrayUCr 1 ; 



COUNT i; 



/* loop index 



V 
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COUNT nuin_sign_change; /* counts number of sign */ 

/* changes */ 
COUNT value; /* for general use temporary */ 

/* storage */ 

COUNT byte_part; /* i if upper 4 bits of byte, */ 

/* and 2 if lower 4 bits of */ 

/* byte */ 
TBITS *comp_arrayUCP? /* ptr to coinp_arrayUC of sig.*/ 

/* data */ 

TBITS byte_maskUC; /* byte used to temporarily */ 

/* hold compressed data */ 
COUNT last_index; /* saves last index of direct.*/ 

/* change */ 

COUNT current_sign? /* used in sign change */ 

/* determination */ 

COUNT last_sign; /* used in sign change */ 

/* determination */ 

BOOL set_4_bits() ; /* function to pack values */ 

/* into the upper or lower */ 

/* 4 bits of a byte */ 

BOOL get_4_bits() ; /* function to unpack values */ 

/* from the upper or lower */ 

/* 4 bits of a byte */ 

BOOL pack_hal f bytes 0 ; /* implements main packing */ 

/* logic */ 

/* 

★ ^ 

* initializations for the compression process. 
-k — 

V 

*nuin_comp_bytesP = o ; 
*corap_arrayUCP = comp_arrayUC ; 

/* 

•k 

* First, save the number of items in the array to be 

* compressed. This value is stored as two bytes. The first 

* byte is the upper 8 bits of the 16 bit COUNT value for 

* numin, and the second byte is the lower 8 bits. 



wo 94/03853 



PCT/US93/06883 



24 



*/ 

*nuin_coinp_bytesP += 1; 

*coinp_arrayUCP++ = (TBITS) ((nuinin >> 8) & Oxf f ) ; 

*nuin_coinp_bytesP += 1; 

*coinp_arrayUCP++ = (TBITS) (numin & Oxff) ; 

/* 

« 

* Find the number of sign changes (i.e., when successive 

* difference values have different signs) and save to 

* coinp_arrayUC. Note that the maximum number of sign changes 

* allowed is 255. The function immediately exits and returns 

* an error flag if > 255 sign changes are detected. 

15 V 

nu!Ti_sign_change = 0? 

last_sign = array[l] - array[0]; 

for (i = 2; i < nuinin; ++i) 



10 



20 



30 



35 



\ 



I I 

] I 



current_sign = array[i] - array[i - i); 
if ( (current^sign < 0 last_sign >= 0) 

(current_sign >= o && last_sign < O) ) 

{ 

last_sign = current_sign ; 
^ ^ -^+nuin_s ign_change ; 



) 

if (nuiT:i_sign_change < 256) 

{ 

*nuin_cor\p_bytesP i; 

*coinp_arrayUCP++ = (TBITS) nuin_sign_change ; 



else 

! 

fprintf (stderr, .:'\n\nCMPSIG ~ nuin_sign_change > 

256\n\n") ; 
return (NO) ; 
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/* 

* Initializations for compression 

* . . . ^ 

V 

byte_part = 1; 
byte^maskUC = 0; 

/* 

* — - . ^ 

* ^ack the sign of the first value of the input data (i.e., of 

* array [0]) 

* ^ 

if (array[01 >= 0) 
value =1; 

else 

value = 0; 

pack_half bytes (value, &byte_part, &byte_inaskUC, 
Stcomp^arrayUCF, nuin_coinp_bytesP) ; 

/* 

*" — — « . ^ 

* Pack the sign of the first difference value (i.e., of 

* array [1] - array [0]) 

X ^ 

V 

last_sign = array[l] - array[0]; 
if (last_sign < 0) 
value = 0; 

else 

value = ■ 1 ; 

pack_halfbytes( value, &byte_part, &byte_inaskUC, 
&coTnp_arrayUCP, nuin_comp_bytesP) ; 



/ 



★ 



* Pack the sign change positions (where successive difference 

* values have different sign) . 
★ 
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last_index = O; 

for (i = 2; i < numin; ++i) 

{ 

current_sign = array [i] - array [i - i]; 
if ( (current_sign < 0 && last_sign >= o) j| 
(current_sign >= o && last_sign < o) ) 



last_sign = current_sign; 
if ((i - last_index) > 255) 

{ 

fprintf (stderr, "\n\nCMPSlG — sign index- 
change > 255\n\n") ; 
return (NO) ; 

I 

pack_half bytes (i - last_index, &byte_part, 
&byte_niaskUC, &coinp_arrayUCP, 
num_coinp_bytesP) ; 

last_index = i; 

) 

/* end of loop to pack sign */ 
/* change position values */ 



l!!!!'''"^"""^ ^^^""^^ ""^^^^ ^^^^ ^° comp.arrayuc 

V 

for (i = 0; i < numin; ++i) 
{ 

if (i == 0) 

value = abs ( array [ 0] ) ; 

else 

value = abs (array [i] - array [i - i]); 
pack_half bytes (value, &byte_part, &byte_naskUC, 

&comp_arrayUCP, num_conip_bytesP) ; 
if (*num'_conp_bytesP >= maxout) 

{ 

fprintf (stderr, "\n\nCMPSIG - comp_arrayUC too 
large\n\n") ; 
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return (NO) 

) 



/* 



/* end of loop to pack */ 
/* difference values */ 



* Finally, if the data ends on an odd number of half bytes, 

* must save one final byte because the last signature data 

* value is in the first half byte of byte_inaskUC and hasn't 

* yet been saved to comp^arrayUC. 

•k ^ 

*/ 

if (byte_part ==2) 
{ 

set_4_bits( (UTINY) 15, 2, &byte_maskUC) ; 
*num_coinp_bytesP += 1; 
if (*nuin_conip_bytesP > maxout) 
{ 

fprintf (stderr, "\n\nCMPSIG — coinp_arrayUC too 

large\n\n"; ; 
return (NO) ; 
) 

*conip__arrayUCP++ = byte_niaskUC; 

) 



/* 



* If this point is Reached the compression process has 

* proceeded normally and no errors have been detected, so 

* return with a "successful completion" status flag. 

* — ^ ^ 

^ / 

return ( YES ) ; 
* 

*/ 

' /* end of function CMPSIG2 

/* PACK_HALFBYTES . C 



* Function to pack a COUNT value into half bytes. 
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* 
* 
* 

*/ 



pack_half bytes returns: i (i.e. YES or TRUE) for 

successful packing 
0 (i.e. NO or FALSE) if 
there is an error 



BOOL pack_half bytes (value, byte_partP, byte.maskUCP, 
coinp_arrayUCPP, nuin_coinp_bytesP) 

COUNT value; /* value to be compressed */ 
COUNT *byte_partP; /* i if ^pp^^ , ^^^^ ^^^^^ 

/* and 2 if lower 4 bits of */ 
/* byte 

TBITS *byte_inaskUCP; /* packed byte representing */ 

/* different values */ 

TBITS **coraD arrayuCPP; /* di-t- i-n r^-h-^ 

K^y-rc, / ptr to ptr to comp_arrayUC */ 

/* element *^ 

COUNT *num_comp_bytesP; /* actual #bytes used in */ 

/* comp_arrayUC */ 

{ ^ 

UTINY tempuc; /* temporary value */ 
UTINY continuation_labelUC;/* used to save continuation*/ 

/* label 

BOOL get_4_bits(); /* function to unpack values */ 

/* from the upper or lower 4 */ 

/* bits of a byte ★/ 

BOOL set_4_bits(); /* function to pack values */ 

/* into the upper or lower 4 */ 

/* bits of a byte */ 

if (value < 14) 



* Pack 4 bit value into byte, when the absolute difference 

* value is < 14, then it can be packed into a half bvte (4 

* bits) . . _ - V 



V 
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set_4_bits((UTlNY) value, *byte nartP h, . 

if (*byte_partP == 2) ^^^-P^^-^P' byte.maskUCP) ; 

{ 

*byte_partP = 2.. 



*nura_coinp_bytesP += i; 
**coinp_arrayUCPP = *byte_n,askUCP; 
++ ( *corap_arrayucpP) ; 

) 

else 

*byte_partP = 2; 

) 

else 

{ 

/* 

Pack 4 bit continuation lahoi .-^4- ^ 

* ai„eren=e value <! 3 'aL " 

■^1-' ^ana, of course* a 

Prevxous block above deals with values I^^" It 

* value 14 into the next half byte an! '-. " 

* into a subsequent half byte If ;he . ^'"''"'^^ 

* value is >= 30 absolute difference 

30, pack 15 into the first h«if v, ^ 

* the re.ai.der into another byte If th ' ^ J 

* value is > 14 + 255 =. 26» absolute difference 

* the function i^eaiatelTe.rtTa:: IZlTlr^"'' 

* condition. returns an error 



if (value > 269) 
{ 

fprintf (stderr, "\n\nCMPSlG - abs dif 



value out of 



fprintf (stderr "-^a^^^ / 

' ^^"^^ ^= %d)\n\n", value)- 
return (NO); ^ °J-ue; , 

J /* return on error detection */ 

else if (value < 30) 

continuation_labelUC = (UTINY) 14- 
else / 

continuation_labelUC = (UTINY) 15; 
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/* 



* First, set half byte for continuation label 
* — 

V 



^®^-4«^its(continuation_labelUC, *byte _partP, 

byte^maskUCP) ; 
if (*byte_partP == 2) 

{ 

^0 *byte_partP = i; 

*nuin_coinp_bytesP += l; 
**coTnp_arrayUCPP = *byte_maskUCP; 
-^-+-(*coinp_arrayUCPP) ; 

) 

^5 else 

*byte_partP := 2 ; 
if (value < 30) 

( 

/* 

20 * 

* Pack a value corresponding to (value - 14) into a second 

* half byte (i.e., a value from 14 to 29 will be converted to 

* a value from 0 to 15 for packing) . 



30 



set_^4_bits((UTINy) (value - 14), *byte_partP, 

byte_maskUCP) ; 
if (*byte_partP == 2) 

( 

*byte_partP = i; 

*num__comp_bytesP -^^ i; 
**comp_arrayUCPP = *byte_maskUCP; 
++(*comp_arrayUCPP) ; 
) 

else 

*byte_partP = 2 ; 

) 

else 
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/* 



* If the original absolute difference value is >= 30 then 

* must pack two more half bytes in additional to the' 

* continuation half byte. 



if (*byte_partP == i) 

{ 

*byte_inaskUCP = (TBITS) (value - 14); 
*nuin_conip_bytesP += i; 

**comp_arrayUCPP = *byte_inaskUCP; 
++(*coiiip_arrayUCPP) ; 

) 

else 
{ 

get_4_bits(&teinpUC, i, (TBITS) (value - 14)). 
set_4_bits(tempUC, 2, byte_inaskUCP) ; 
*num_coinp_bytesP += i; 
**coinp_arrayUCPP = *byte_it,askUCP; 
-t-+(*comp_arrayUCPP) ; 

get_4_bits(&tempUC, 2, (TBITS) (value - 14))- 
set_4_bits((UTINY) tempUC, 1, byte_r.askUCP) ; ' 



) 



) 

return (YES) ; * ,,,, 
^ ^^^ue unpacked successfully*/ 

/* UNCHPSIG2.C °' Pack.half bytes */ 



.-nc.on to uncompress an array of data compressed by 
.unct.on CMPSIC2. NOTE: compression/uncompression is exact 
(not lossy) and restores the original data exactly 



* 



uncmDsig2 returns* 1 ^ 

y returns. i (i.e. YES or TRUE) for 

successful uncompress 
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0 (i.e. NO or FALSE) if 

* there is an error 
* 

* input arguments: 

5 * comp^arrayUC - unsigned byte array containing the 

compressed representation of the data 

* (as created by CMPSIG2.C) 

* output arguments: 

10 * numoutP - ptr to number of elements (length) of the 

* uncompressed data 

array - data array created by unpacking and 

reconstructing values taken from the 

compressed array comp_arrayUCr 1 . 
15 ★ 

V 

#include <stdio.h> 
#include <cic.h> 



20 



25 



30 



35 



BOOL uncinpsig2(numoutP, array, comp^arrayUC) 



COUNT *numoutP; 



COUNT array [ ] ; 



/* number of points in output */ 



TBITS comp_arrayUC[ ] 



COUNT i; 

COUNT kount^bytes; 



/* array 

/* array for output signat 

/* data that has reconst from 

/* comp^arrayUC by reversing 

/* the compression proc. 

/* compressed input array of 

/* sig data 

/* loop index 
/* kounts bytes used in 
/* comp_array 



*/ 
*/ 
*/ 
V 
*/ 
*/ 
*/ 

*/ 
*/ 
*/ 



^BITS *comp_arrayuCP; /* ptr to comp_arrayUC of sig v 



COUNT num_sign_change; 



COUNT value; 



/* data 

/* counts number of sign 
/* changes 
/* for general use temporary */ 
/* storage 



*/ 
*/ 
*/ 
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COUNT byte_part; /* 1 if upper 4 bits of byte, */ 

/* and 2 if lower 4 bits of */ 
/* byte 

TBITS byte.maskUC; /* packed byte representing */ 

/* different values */ 

COUNT sign_change[256]; /* array of sign change */ 

/* indices */ 

COUNT *sign_changeP; /* pointer to sign change */ 

/* index array */ 

COUNT sign_first_dif; /* saves sign of first */ 

/* difference */ 

COUNT current_sign; /* current value of sign */ 

BOOL unpack_halfbytes(),- /* implements main unpacking */ 

/* logic */ 

/* 

★ 

* Initializations for compressed array 
* 

*/ 

kount_bytes = O; 
comp_arrayUCP = comp_arrayuC ; 

/* 

* ^ . 

* First, get the number of items in the array that was 

* compressed. This value is stored as two bytes. The first 

* byte is the upper 8 bits of the 16 bit COUNT value for 

* numout, and the second byte is the lower 8 bits. 

★ ^ ^ 

*/ 

^numoutP = 0; 

*nuinoutP =r *coinp_arrayUCP++; 

*nuinoutP <<= 8; 

*nuinoutP = *nu:noutP ; ( *conp_arrayUCP+-) ; 
kount_bytes 2 ; 



* Get the number of sign changes 
★ ^ 
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*/ 

nun,_sign_change = (COUNT) (*coinp_arrayUCP++) • 
++kount_bytes ; 

/* 



* null '^l condition, if the value read in for the"" 
number of s.gn changes is >= 256, then an error has 

* occurred. ' 



*/ ■ 



if (nuin_sign_change < o ; ; nuin_sign_change >= 256) 

fprintf(stderr, "VnXnUNCMPSlG - number of sign 

changes out " ) ; 
fprintf(stderr, "of range (= %d) is >= 256\n\n", 

nuin_sign_change) ; 
return, NO,, /« return on ..tecticn v 





V 



initializations for unpacking. 
byte_parr = i ; 



• unpao: the sign the first value of the ori,"'na"r 

* uncompressed data. •^^ginai, 



*/ 



unpack_haifbytes(&value, &byte part ^hv.- 

' "'^•i"-s_parr , &byte_maskUC, 

&coinp_arrayUCP, &kount bytes) • 
if (value == o) - i i . 



current_sign = -i; 

if (value == i) 
current_sign = i; 
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/* 



*. 



* Error condition since, value != 0 (for minus sign) and also 

* != 1 (for plus sign). 

5 * 

*/ 

fprintf (stderr, "\n\nUNCMPSIG — initial sign value ") ; 

fprintf (stderr, "is inconsistent (= %d)\n\n", value); 

return(NO); /* return on ERROR detection */ 

10 ) 

/* 

★ 

* Unpack the sign of the first difference value of the 

* original, uncompressed data. 

15 

V 

unpack^half bytes (&value, &byte_part, &byte_inaskUC, 
&coinp_arrayUCP , 
&kount_bytes) ; 
20 if (value == O) 

sign_first_dif = -l; 
else if (value == l) 

sign_f irst_dif = i; 

else 

25 { 
/* 

★ 

* Error condition since value != o (for ninus sign) and also 

* ! = 1 (for plus sign) . 

30 * 

V 

fprintf (stderr, "\n\nUNCMPSIG ~ initial sign value "); 

fprintf (stderr, "is inconsistent (= %d)\n\n", value); 

return (NO); /* return on ERROR detection */ 

35 ) 

/* 

* 

* Unpack sign change positions (where successive difference 
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" * values have different sign) . 



* — 
*/ 



for (i = 0; i < num_sign_change ; ++i) 

{ 

unpack_halfbytes(&value, &byte_part, &byte_inaskUC, 
&comp_arrayUCP, &kount_bytes) ; 

if (i == 0) 

sign_change[0] = value; 

else 

sign_change[i] = sign_change[ i - 1] + value; 
^ /* end of loop to unpack sign */ 

/♦change position values */ 



* Unpack differences from bytes and read from disk. 

★ ^ 

V 

sign_changeP = sign_change ;/* pointer to sign change */ 

/♦index array 
for (i = 0; i < *nuinoutP; ++i) 

{ 

unpack_halfbytes( lvalue, &byte_part, &byte_maskUC, 

&comp_arrayUCP, &kount_bytes) ; 
if (i > 1 && nuin_sign_change > o && i « *sign_changeP) 

current_sign = -current_sign; 
— num_sign_change; 
++sign_changeP ; 

) 

if (i == 0) 

arrayrii = current_sign * value; 

else if (i == i) 

{ 

sign_cha ? = sign_change; 

array[i; = array[i - i] + sign_f irst_dif * value; 

current_. gn = sign_f irst_dif ; 

) 
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else 

array[i] = array [i - 1] + current_sign * value; 
) /* end of loop to unpack and */ 

/* reconstruct data */ 

/* 

★ — . 

* Check for inconsistency in specified size (in bytes) of the 

* input compressed array (comp^array) versus the actual number 

* of bytes used in the uncompress process to create the output 

* uncompressed array. 

★ « . ^ ^ 

if (kount_bytes != num_comp_bytes) 

( 

fprinrf (stderr, '•\n\nUNCMPSIG — kount^bytes (= %d) 

i = " , kount^bytes ) ; 
fprintf (stderr, "num_comp_bytes (= %d)\n\n", 

num_comp_bytes) ; 

return (NO) ; 
) 

V 
/* 

* 

^ this point is reached the uncompression process has 

* proceeded normally and no errors have been detected, so 

* return with a "successful completion" status flag. 
-k ^ « ^ ^ 

*/ 

return ( YES) ; 

/★ 

^ /* end of function UNCMPSIG2 */ 

UNPACK^HALFBYTES . C 



* Function to unpack and reconstruct a COUNT value from half 

* bytes. 

* pack_half byres rerurns: 1 (i.e. YES or TRUE) for 

* successful unpacking 

* 0 (i.e. NO or FALSE) if 
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* 



there is an error 



*/ 

BOOL unpack_halfbytes(valueP, byte_partP, byte_maskUCP, 
comp_arrayUCPP, kount_bytesP) 



COUNT *valueP; 

COUNT *byte_partP; 

TBITS *byte_maskUCP; 
TBITS **coinp_arrayUCPP; 
COUNT *kount_bytesP; 

{ 

UTINY tempUC; 
UTINY valueUC; 
BOOL get_4_bits() ; 

BOOL set_4_bits() ; 



/* contains uncompressed */ 
/* value to be returned to */ 
/* main program */ 
/* 1 if upper 4 bits of byte,*/ 
/* and 2 if lower 4 bits of */ 
/* byte 

/* packed byte representing 
/* different values 



*/ 
*/ 
*/ 



/* ptr to comp_arrayUC of sig.*/ 



/' 



/* data 
/* kounts bytes used in 
/* coinp_array 

/♦temporary value 

/* temporary value 

/* function to unpack values 

/* from the upper or lower 4 

/* bits of a byte 

/* function to pack values 

/* into the upper or lower 4 

/* bits of a byte 



*/ 
*/ 
*/ 

*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
V 
*/ 



* unpack 4 bits. When the value is < 14 , it corresponds to a 

* legitimate difference value, and hence the next array value 

* can be calculated by adding the difference (with the correct 

* sign) to the preceding array value. If the value == 14 , the 

* absolute difference value is >= 14 and <= 29, hence another 

* half byte must be read in to compose the absolute 

* difference, if the value « 15, the absolute difference 

* value is >= 30 and < 269, hence another full byte (i.e., 2 

* half bytes) must be read in to compose the absolute 

* difference. 



^O'-^/O^SSa PCr/US93/06883 

39 



*/ 

if (*byte_partP == i) 

{ 

++(*kount_bytesP) ; 

*byte_raasJcUCP = **coinp_arrayUCPP; 

++(*coinp_arrayUCPP) ; 

) 

get_4_bits(&valueUC, *byte_partP, *byte_inaskUCP) ; 
*byte_partP = 3 - *byte_partP; 

if (valueUC < 14) 

*valueP = (COUNT) valueUC; 
else if (valueUC == 14) 

( 

if (*byte_partP == i) 

( 

■*byte_mas}:UCP = **coinp_arrayUCPP; 
++ (*comp_arrayUc;pp) ; 
++ ( *kount_bytesP) ; 

) 

get_4_bits(&valueUC, *byte_partP, *byte_maskUCP) ; 
*valueP = (COUNT) (valueUC + 14); 
*byte_partP = 3 - *byte_partP; 
) 

else if (valueUC == 15) 
{ 

if (*byte_partP == i) 

{ 

*byte_maskUCP = **coinp_arrayUCPP; 
++(*conp_arrayUCPP) ; 
++ (*kount_bytesP) ; 

) 



get_4_birs(&t6inpuc, *byte_partP, *byte_maskUCP) ; 
set_4_bits(tempUC, l, SvalueUC) ; 
*byte_partP = 3 - *byte_partP; 
if (*byre_partP == i) 

( 
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*byte_niaskUCP = **c,onp_arrayucpP; 
++ ( *coinp_arrayUCPP) ; 
++(*kount_bytesP) ; 

) 

^ get_4_bits(&teinpUC, *byte nartP 

.1 u-^ ,^ °yte_partP, *byte_niaskUCP) ; 

set_4_bits(ten.puc, 2, &valueUC) ; 
*valueP = (COUNT) (valueUC + 14) • 
*byte_partP = 3 - *byte_partP; 

10 else 

( 

*byte_n,askUCP = **con,p_arrayucpP; 
++(*comp_arrayuCPP) ; (*kount_bytesP) ; 

return (YES) ; /* , 

/* ^ ^^^""^ unpacked successfully 



15 



V 



/* end Of function UNPACK^halfbytes */ 

20 /* GET_4_BITS.C 



ung to 



* Function that returns the UTINY valu. . 

* Pi>h^>>- ^-u ^^i.rix value correspond!' 
either the upper 4 bite: 4-1. • f^naj.. . 

2= • The input arguement ..position., d.tl ■ ° *° 

* or lo..er 4 bits is t.L: >:pper 

* 

* position =1 

upper four bits (or upper 
3 0 * "nybble") 
position = 2 — 1 

f°ur bits (or upper 
^ "nybble") 



X / 



^include <stdio.h> 
35 =include <cic.h> 



UTINY *valueUCP; _ . . . 

■o this */ 



/* set the four bits tc 



/* value 

*/ 



wo 94/03853 



PCT/US93/06883 



41 



COUNT position; 
TBITS byte_maskUC; 

{ 



/* 1 specifies the upper 4 */ 
/* bits, and 

/* feature mask for template */ 
/* selection 2 specifies the */ 
/* lower 4 bits 





* Return original mask if error condition 


found 






if (position != 1 position != 2) 
return (NO) ; 

* 




* Get 4 bits (or nybble) specified by the 

* "position" 


input argument 





20 



if (position == i) 

byte^maskUC >>= 4. 
*valueUCP = (UTINY) (byte_maskUC & Oxf ) ; 

/* zero the upper four bits 



return (YES) ; 



25 /*-. 
*/ 

V 



/* end of function GET_4_BITS. C*/ 



30 



35 



* SET_4_BITS.C 


* Function that sets either 

* byte, or the lower 4 bits 

* specified by the input argument "value" 



the upper 4 bits in the input 
to a value in the range o to is 
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'position = ■ 1 



upper. four bits (or upper 
* "nybble") 



★ 
★ 



position =2 ~ lower four bits (or upper 

"nybble") 



* NOTE: function returns a boolean YES for normal operation 

* and NO if an error is detected. 

* 

V 

^include <stdio*h> 
ifinclude <cic.h> 

BOOL set_4_bits(valueUC, position, byte_inaskUCP) 

UTINY valueuc; /* set the four bits to this */ 

/* value *y 
COUNT position; /* specifies the position of */ 

/* the two bits ★ / 

TBITS *byte^naskUCP; /* feature mask for template */ 

/* selection */ 

/* 

★ ^ 



Return original mask if error condition found 



if (valueuc > (UTINY) 15) 

return (NO) ; 
if (position != i' &f< position != 2) 

return (NO) ; 

/* 

* Set bits as specified by the input arguments 

— — — — — — — — — — — » 

*/ 

if (position == i) 

( 

*byte_niaskUCP &= " (OxfO) ; 

/* zero upper 4 bits inbyte_inaskUCP*/ 
*byte_inaskUCP |= (valueUC << 4); 



0 
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/* ser upper 4 bits to new value */ 

else 

{ 

*byte_inaskUCP &= "(Oxf); 

/* zero lower 4 bits in byte_inaskUCP*/ 
*byte_maskUCP |= valueUC; 

/* set upper 4 bits to new value */ 

return (YES) ; 

/* 

^ /* end of function SET 4 BITS * / 

/* - - / 

— — — — — — — 

* This file contains standard type definitions, defined 

* constants, macros, and data structures. 
* 

#define FAST register 
=define GLOBAL 

#define IMPORT extern ' 

=define THISFL extern 

^define INTERN static 

^define LOCAL static 



typedef void 
typedef char 
typedef short 
typedef int 
typedef long 



VOID; 

TBOOL, TINY, TEXT; 
COUNT, SHORT; 
INT, ARGINT, METACH ; 
LONG ; 



typedef unsigned char UTINY, TBITS, UTEXT; 

typedef unsigned short BITS, UCOUNT, USHORT, SI2ETYPE; 

rypedef unsigned int BYTES, UINT; 

typedef unsigned long ULONG, LBITS ; 



typedef float FLOAT; 
typedef double DOUBLE; 
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typedef USHORT ERCTYPE; 

typedef enum 

{ 

5 NO = 0, 

YES = 1 
) BOOL; 

typedef enum 

0 { 

SUCCEED = 0, 

FAIL = 1 

] SUCCESS; 

> =ifndef abs 

??define abs(x) (((x) < O) ? -(x) : (x) ) 

??endif 

#ifndef max 

#define max(x,y) ( { (x) < (y) ) ? (y) : (x)) 
" ==endif 

#ifndef min 

. #define min(x,y) (((x) < (y) ) ? (x) : (y) ) 
j^endif 

=define FOREVER while (YES) 



#def ine EOS { ' \0 • ) 
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WE CIAIM: 



1. A method for electronically compressing 
handwritten data expressible as a sequence of data points, the 
method comprising the steps of: 

capturing a parametric representation of data 
handwritten on a digitizer tablet; 

determining local extrema and saving representations 
of said local extrema as first data points; 

determining stroke endpoints and saving ' 
representations of said stroke endpoints as second data points; 
and 

calculating curvature locally about selected data 
points between said second data points and saving only selected 
representations of said selected data points based on 
curvature. 

2. The method according to claim 1 wherein said 
parametric representation of the digitizer data is a sequence 
of X and y coordinate values in a Cartesian coordinate system 
as a function of time. 

3. The method according to claim 1 wherein said 
local extrema determining step comprises selecting minimum x^ 
and y^ and maximum x^ and y^ according to the following 
relationships: 



minimuni = 


H < 


Xi+i and Xi 


< 


^i-1' 


maximuin = 


H > 


Xi+i and Xi 


> 




minimum = 


Vi < 


Vi+l and y^ 


< 


yi'i' 


maximum = 


Vi > 


Vi+l and y^ 


> 




Xj^ and 


over 


the range i 




0 to i 



where 



discrete representation of a stroke, and 
nsamps is the number of points. 

4. The method according to claim 1 wherein said 
stroke endpoint determining step comprises selecting a first 



0 



wo 94/03853 PCr/US93/06883 

46 

'^endpoint of a stroke, Xq and y^, and selecting a last endpoint 

of a stroke, x__-_,„^ , and v 

nsamps-l J^nsamps-l' 

where 

and over the range i = 0 to i = nsamps-1 is the 
5 discrete representation of a stroke, and 
nsamps is the number of points. 

5. The method according to claim 1 wherein said 
curvature determining step comprises calculating an angle 
between adjacent data points according to the relationkhip: 

" ^i+d ~ ^i-d' 

where 

= - 3 60 degrees if > 18 0 degrees, 
= + 360 degrees if < -180 degrees, 
^i+d = 3^=tan (Xi^^ - Xi, y^^^ - y.), 
^i-d = ^^ctan (Xi - Xi.^, y. - y ,_^) , 
d is an integer no larger than 4, 

x^ and y^ over the range i = d to i = nsamps-l-d is 
the discrete representation of a stroke, and 
nsamps is the number of points. 

6. The method according to claim 5 wherein said 
curvature angle calculating step further ccr.prises discarding 
data points with a curvature angle less than 5 degrees or 
greater than 25 degrees. 

7. The method according to claim 5 wherein said 
curvature angle calculating step further comprises discarding 
data points with a curvature angle less than 25 degrees or 
greater than 50 degrees. 

8. The method according to claim 1 further 
including a preprocessing step of discarding duplicate data 
points according to the following relationships: 

if >ii = and y. = y._^, 

then discard x^ and y^ ; 
if - Xi_^ and y. . y._^, 
then save x^ and y^ ; 
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'Where 

and over the range i = 1 to i = nsamps-l is the 
discrete representation of a stroke, and 
nsainps is the number of points. 

9. The method according to claim l further 
including a preprocessing step of discarding data points 
according to the following procedure: 

set j = 1, 

calculate d = { (Xq - Xj)2 + (y^ _ y.)2^h^ 

If d < 5, then discard point corresponding to index 

j, set j = j +1, and recalculate d; 

If d > 5, then shift remaining points down so that 

the point (Xj, yj) corresponds to (x^, y^) , and quit the 

calculation, 

where 

and y. over the range i = o to i = nsamps-l is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

10. The method according to claim 1 further 
including a preprocessing step of discarding data points 
according to the following procedure: 

set j = 2, 

calculate d = < (x. -v \2, 

^ ^ -^nsamps-l ^nsamps-j) ^ 

^^nsamps-i'^nsaraps-j ) ^ ' ' 
If d < 5, then discard point corresponding to index 

j, set j = j + 1, and recalculate d; 

if d > 5, then shift remaining points down so that 

uhe point (x„sgn,pg_j, yngamps-j ) corresponds to (Xnsaiiips-2 ' 

ynsamps-2) quit, the calculation. 

11. The method according to claim l further 
including a preprocessing step of dividing data points between 
said first data points into groups of three adjacent points and 
calculating the median value of said group according to the 
following relationships: 

x. = median value of [x^,^, Xj , x.^^j, and 
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"" yj = median value of {yj.^, yj , yj + i). 

where 

Xi and y . over the range i = 1 to i = nsainps-2 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

12. The method according to claim i further 
including a preprocessing step of dividing data points between 
said first data points into groups of three adjacent data 
points and calculating a representative value for said group 
according to the following relationships: 

Xj = 0.25Xj_^ + o.5Xj + 0.25Xj+;^, and 
yj = 0.25yj.^ + o.Syj + 0.25y.^^. 

where 

x^ and y . over the range i = 1 to i = nsamps-2 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

13. An apparatus for electronically compressing 
handwritten data expressible as a sequence of data points, the 
apparatus comprising: 

means for capturing a parametric representation of 
data handwritten on a digitizer tablet; 

means for determining local extrema and saving 
representations of said local extrema as first data points; 

means for determining stroke endpoints and saving 
representations of said stroke endpoints as second data points; 
and 

means for calculating curvature locally about 
selected data points between said second data points and saving 
only selected representations of said selected data points 
based on curvature. 



14. The apparatus 
parametric repress ation of 
of X and y coordi: -3 values 
as a function of t-..-ue. 



according to claim 13 wherein said 
the digitizer data is a sequence 
in a Cartesian coordinate system 
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Id. The apparatus according to claim 13 wherein said 
local extrema determining means comprises means for selecting 
minimum and and means for selecting maximum x, and y 
according to the following relationships: ^ 

minimum = x^ < x^^^ and x^ < x^.^, 

maximum = x^ > x^^^ and x^ > x^.^, 

minimum = y. < y._^^ and y. < y . _^ , 

maximum = y. > y.^^ and y. > y. 

where 

Xi and y^ over the range i = 0 to i = nsamps-2 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

16. The apparatus according to claim 13 wherein said 
stroke endpoint determining means comprises means for selecting 
a first endpoint of a stroke, x^ and y^, and means for 
selecting a last endpoint of a stroke, x^^^^^^ , and y 
where nsamps-l i^nsamps-i' 

Xi and y. over the range i = 0 to i = nsamps-l is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

17. The apparatus according to claim 13 wherein said 
curvature determining means comprises means for calculating an 
angle between adjacent data points according to the 
relationship: 

" ^i+d " 

where 

Ci = - 360 degrees if > 180 degrees, 

= + 3 60 degrees if < -180 degrees, 
ai,d = arctan (x^^^ - x^, y.^^ - y . ) , 
ai.d = arctan (x^ - x^.^, y^ - y , 
a IS an integer no larger than 4, 

Xi and y. over the range i = d to i = nsamps-l-d is 
.he discrete representation of a stroke, and 
nsamps is the number of points. 
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18. The apparatus according to claim 17 wherein said 
curvature angle calculating means further comprises means for 
discarding data points with a curvature angle less than 5 
degrees or greater than 25 degrees. 

19. The apparatus according to claim 17 wherein said 
curvature angle calculating means further comprises means for 
discarding data points with a curvature angle less than 25 
degrees or greater than 50 degrees. 



20. The apparatus according to claim 13 further 
including a preprocessing means for discarding duplicate data 
points according to the following relationships: 

if Xi = and = y._^, 

then discard x^^ and y^; 
if • x^.T and y. ^ y._^, 
then save x^^ and y^; 

where 

Xi and y^ over the range i = i to i = nsamps-l is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

21. The apparatus according to claim 13 further 
including a preprocessing means for discarding data points 
according to the following procedure: 

set j = 1, 

calculate d = { (x^ - x j ) 2 + (y^ . y.j2jb, 

If d < 5, then discard point corresponding to index 
J, set j - j + 1^ and recalculate d; 

if d > 5, then shift remaining points down so that 
the point (Xj, y.) corresponds to (x,, y,) , and quit the 

calculation, 
where 

Xi and y over the range i = o to i = nsamos-l is the 
discrete represent: ;-ion of a stroke, and 
nsamps is the number of points. 
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" 22. The apparatus according to claim 13 further 
including a preprocessing means for discarding data points 
according to the following procedure: 
set j = 2, 

calculate d = { (x^,, , -v . i 2 . 

^ ^^nsamps-l ^nsamps-j' 

(ynsainps-i"ynsainps- j ^ ^ J ' 
if d < 5, then discard point corresponding to index 

j, set j = j + 1, and recalculate d; 

if d > 5, then shift remaining points down so that 

the point (Xnsa^pg.j, Ynsamps-j) corresponds to 

ynsamps-2) ^""^ ^"i^ the calculation. 

23. The apparatus according to claim 13 further 
including a preprocessing means for dividing data points 
between said first data points into groups of three adjacent 
points and means for calculating the median value of said group 
according to the following relationships: 

Xj = median value of {Xj.;^, x j , Xj^^}, and 
Yj = median value of (Yj.^, y . , y^ + j_) . 

where 

Xi and over the range i = 1 to i = nsamps-2 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

24. The apparatus according to claim 13 further 
including a preprocessing means for dividing data points 
between said first data points into groups of three adjacent 
data points and means for calculating . a representative value 
for said group according to the following relationships: 

Xj = 0.25Xj.^ + o.5Xj + 0.25Xj^^, and 
Yj = 0.25yj_^ + o.SYj + 0.25y.^^. 

where 

Xi and y^ over the range i = 1 to i = nsamps-2 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 
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